PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa20g081840.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 362aa    MW: 40816 Da    PI: 9.5012
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa20g081840.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.64.5e-18203257256
                     T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     rk+ +++k+q   Lee F+++++++ +++  LAkkl+Lt rqV vWFqNrRa+ k
  Csa20g081840.1 203 RKKLRLSKDQSAFLEETFKEHNTLNPKQKLALAKKLNLTARQVEVWFQNRRARTK 257
                     788899***********************************************98 PP

2HD-ZIP_I/II119.71.5e-38203292191
     HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                     +kk+rlsk+q+++LEe+F+e+++L+p++K +la++L+l++rqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+ke  eLr +l
  Csa20g081840.1 203 RKKLRLSKDQSAFLEETFKEHNTLNPKQKLALAKKLNLTARQVEVWFQNRRARTKLKQTEVDCEYLKRCVEKLTEENRRLQKEAMELR-TL 292
                     69*************************************************************************************9.44 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046185.1E-2280169IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.602.2E-18183260IPR009057Homeodomain-like
SuperFamilySSF466891.24E-18186260IPR009057Homeodomain-like
PROSITE profilePS5007117.313199259IPR001356Homeobox domain
SMARTSM003896.9E-16201263IPR001356Homeobox domain
CDDcd000869.06E-16203260No hitNo description
PfamPF000461.5E-15203257IPR001356Homeobox domain
PROSITE patternPS000270234257IPR017970Homeobox, conserved site
PfamPF021835.7E-8259292IPR003106Leucine zipper, homeobox-associated
SMARTSM003402.5E-23259302IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 362 aa     Download sequence    Send to blast
MSHLTKSHHS KYYINPPLSS SSIPHISLLS PTTKGSIDHL LQVETTHLYH LSGYFLSLTQ  60
KKKKIKPFFF FKLLSSLRMM MGKEDLGLSL SLGSSQNHNP LQLNLNHNAS LSNNLQRFPW  120
NQTFDHTSDL RKIDVNSFPS TANCEEETGV SSPNSTISST ISGKRSEREG ISGASDDHDE  180
ITPDRGYSRG TSDEDDDGGE TSRKKLRLSK DQSAFLEETF KEHNTLNPKQ KLALAKKLNL  240
TARQVEVWFQ NRRARTKLKQ TEVDCEYLKR CVEKLTEENR RLQKEAMELR TLKLSPQFYG  300
QITPPTTLIM CPSCERVAGP SPSSSNHHHH QNHRPVSINP WVACAGQVAH GLNFEALRPR  360
S*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1201207SRKKLRL
2251259RRARTKLKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF4284500.0AF428450.1 Arabidopsis thaliana AT5g47370/MQL5_23 mRNA, complete cds.
GenBankAJ4311830.0AJ431183.1 Arabidopsis thaliana mRNA for homeodomain-leucine zipper protein HAT2 (hat2 gene).
GenBankAY0523240.0AY052324.1 Arabidopsis thaliana AT5g47370/MQL5_23 mRNA, complete cds.
GenBankAY0619040.0AY061904.1 Arabidopsis thaliana AT5g47370/MQL5_23 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010494967.10.0PREDICTED: homeobox-leucine zipper protein HAT2-like
SwissprotP466010.0HAT2_ARATH; Homeobox-leucine zipper protein HAT2
TrEMBLD7MPU40.0D7MPU4_ARALL; Putative uncharacterized protein
STRINGAT5G47370.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15632690
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G47370.11e-127Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein